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FIELD OF THE INVENTION 



The present invention relates generally to video 
conferencing systems, and, more particularly, to a 
peripheral video conferencing system adapted for stand- 
alone use and operation with a separate host computer 
20 system. 
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BACKGROUND OF THE INVENTION 

Video teleconferencing systems which employ 
custom audio and video processing components and 
proprietary signal processing techniques for effectuating 
video teleconferencing over a dedicated network link are 
known, custom prior art video teleconferencing systems, 
such as the one illustrated in Fig. l, typically employ a 
local video, processing system 20 and a remote video 
processing system 30 that exchange audio and video 
information over a dedicated or specialized network link 
26. Manufacturers of such prior art video processing 
systems typically utilize custom audio and video components 
m the design and manufacture of custom audio and video 



coder and decoder (CODEC) assemblies which generally 
communicate only with CODEC assemblies produced from the 
same manufacturer. Utilization ^of cus torn • CODEC assemblies 
typically necessitates employment of custom communication 
interfaces 24 and 34 to interface the custom video 
processing systems 20 and 30 with the dedicated network 
link 26. 

Employing custom audio and video CODEC assemblies 
and signal processing techniques in the production of video 
teleconferencing systems generally results in the 
manufacture of low-volume, high-cost systems that are 
typically affordable only to governmental entities and 
larger corporations. Early custom video processing systems 
20 and 30 typically sold for over $100,000 per individual 
system, with operational costs often exceeding $200 per 
hour to communicate over a dedicated network link 26. 

Recent advances in video teleconferencing 
technology have resulted in a reduction in the production 
and procurement costs of video teleconferencing systems. 
By 1993, two emerging leaders in the video teleconferencing 
system market, PictureTel and VTel, were marketing systems 
having an average purchase price of approximately $40,000. 
These and other prior art video teleconferencing systems, 
however, continue to employ custom audio/video components 
and signal processing techniques. The high costs 
associated with the purchase and operation of prior art 
video teleconferencing systems and, in particular, the lack 
of compatibility between dissimilar systems, severely 
limits .the efficacy of video teleconferencing as a 
communication tool for businesses and individuals. 

Desktop video teleconferencing systems have 
recently been developed to take advantage of the relatively 
low-cost processing capability of today's personal computer 
systems. Manufacturers of such desktop video 
teleconferencing systems generally produce and market a set 



of video teleconferencing computer cards which, together 
with the computer system into which the cards are 
installed, provide desktop video teleconferencing of 
limited quality and functionality. A video 
teleconferencing card set adapted for use in a host 
computer system typically comprises three individual 
computer cards, each of which must be installed into a 
corresponding card slot within the computer system chassis. 
A conventional personal computer system 40 is illustrated 
in Fig. 2 with its housing cover removed. A typical 
personal computer system 40 generally includes a power 
supply 42, one or more hard disk and floppy disk drives 44, 
internal random access memory, and up to eight card slots, 
all of which are generally connected to and communicate 
over a motherboard 51. 

A user of a prior art desktop video 
teleconferencing system must typically disassemble the 
computer system's housing to gain access to the internal 
components, install the. three video teleconferencing cards 
into three card slots 46, 48, and 50, modify the settings 
of various configuration switches on the motherboard 51, 
reassemble the computer system's housing cover and base, 
and then reconfigure the operating system software of the 
computer system 40 to acknowledge and communicate with the 
newly installed video teleconferencing card set. Although 
the cost of prior art desktop video teleconferencing 
systems are comparatively lower in cost than the video 
processing systems 20 and 3 0 previously discussed, such 
desktop systems continue to be prohibitively expensive to 
many businesses and the individual consumer. More 
significantly, a potential purchaser of a prior art desktop 
video teleconferencing system may be unable or unwilling to 
dedicate up to three internal card slots 45, 48, and 50 for 
the installation of the video teleconferencing cards. 
Moreover, the typically complex task of re-configuring both 



the hardware and the software of the personal computer 

cTrLtV?"^-:""'"^""" '""^ teleconferencing 
card set to facrUtate desktop video teleconferencing „ay 
well deter a user tram investing in such a curobersome 

3 system. 

1 lustratea rn Figs. 1 and 2 fail to provide a high degree 
Of portability from one conferencing site to another The 
system shown in Fig. 1 is usually permanently installed in 

10 dedicated local and remote conferencing rooms, where 
parties must convene in order to participate in a video 
teleconference. Access to the dedicated network link 26 is 
generally available only at the permanent conferencing 
srtes, thus precluding the ability to move the video 

15 teleconferencing systems 20 and 30 to other desired 
conferencing sites. 

Transporting the prior art desktop video " 
teleconferencing system illustrated in Fig. 2 to a new 
conferencing site is similarly impractical. The desktop 
20 computer system 40, keyboard, monitor, cables, and any 
other attached peripheral devices must be disconnect 
transported, then reconnected at the new conferencing site 
Although transporting a prior art desktop video 
teleconferencing system 40 and associated hardware can be 
■5 accomplished through great effort, such systems generally 
require a dedicated network link typically unavailable at 
other desirable conferencing sites. Further, use of custom 
audro/vrdeo components and signal processing techniques 
continues to severely limit the portability of prior art 
0 desktop video teleconferencing systems. 

There is a desire among the manufacturers and 
users of video teleconferencing systems to minimize the 
complexity of installing, configuring, and operating a * 
_ video teleconferencing system. There exists a further 
5 desire to enhance the portability of a video 



teleconferencing system to facilitate easy transport of the 
system to a plurality of conferencing sites. Additionally, 
there continues to exist in the video teleconferencing 
equipment manufacturing community a keenly felt need to 
provide full-color, full-motion video teleconferencing 
systems which can communicate in conformance with 
internationally recognized communication standards, and be 
purchased at a relatively low cost. The present invention 
fulfills these and other needs. 



SUMMARY OF THE INVENTION 



The present invention is a peripheral 
audio/visual communication system that communicates with 
analog and digital communication channels for transmitting 
video, audio, and other information acquired from a local 
conferencing site, and receiving audio and video 
information from a remote conferencing site. The invention 
also comprises a high-speed interface for communicating 
with a separate host computer system and includes visual 
conferencing application software to enhance the 
functionality of the audio/visual communication system. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of prior art video 
processing systems incorporating custom audio/video 
components and custom communication interfaces to transmit 
and receive^i-nf ormation over a dedicated network link; 

^x'^ig. 2 is a generalized top view of a prior art 
personal computer system with its housing cover removed and 
three prior art video teleconferencing cards installed in 
the computer system; 



^Fxg. 3 is an illustration of a video conferencing 
system employing a novel peripheral audio/visual 
c ommun i c a t i on,^-s'y s t em ; 

^^^Fig. 4 is an illustration of various system 
5 boards of a novel peripheral audio/visual communication 
system; 

Fig'. 5 is a system block diagram of a novel 
peripher^f audio/visual communication system coupled to a 
communication channel; 
iO ^- 'Fig, 6 is an illustration of various connectors, 

jacks, and transceivers comprising the output, input, and 
communication channel interfaces of a novel peripheral 
audio/visual communication system; 

^Ig. 7 is a diagrammatic illustration of a video 
15 conferencing system employing two novel peripheral 

audio/visual communication systems that provide video 
conferencing between local and remote conferencing sites 
over a communication channel; 

"^Fig. 8 is a block diagram of a video processing 

2 0 board suitable for processing local or remote video signals 

communicated to a novel peripheral audio/visual 
communication system; 

.r-'Fig. 9 is a block diagram of an audio processing 
board suitable for processing local and remote audio. 
25 signals communicated to a novel peripheral audio/ visual 
c ommuni c a t i<5ri sy s t em ; 

Fig. 10 is a generalized illustration of various 
informational fields comprising a read or write 
coordination instruction produced by a host computer system 

3 0 and accompanying visual conferencing application software 

operating thereon, and transmitted over a high-speed 
interface cojipling the separate host computer system and a 
novel gje-rl^heral audio/ visual communication system. 

Fig. 11 is a diagrammatic illustration of a video 
35 conferencing system configuration employing two novel 
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peripheral audio/visual communication systems, with 
associated host computer systems coupled thereto, 
cooperatively communicating to provide window sharing and 
document collaboration between a local and remote 
5 conf erencing^'site; 

/Fig. 12- is a generalized flow diagram 
illustrating various process steps associated with window 
sharing and document collaboration operations coordinated 
by visual conferencing application software operating on a 
10 host computer system in cooperation with a novel peripheral 
audio /visual communication system; 

^^..Fig. 13 is a generalized flow diagram 
illustrating various processing steps associated with the 
coordination of enhanced visual conferencing functions 
15 orchestrated by visual conferencing application software 
operating on a host computer system in cooperation with a 
novel peripheral audio/visual communication system; 

^ig. 14 is an illustration of one embodiment of a 
periphea?^ housing configuration for a novel peripheral 

2 0 audio/visual communication system; and 

Fig^f 15 is an illustration of a back panel of a 
peripheral housing configured with a plurality of input and 
output connectors, jacks, and transceivers for a novel 
peripheral audio/ visual communication system. 

25 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring now to the figures, and more 

3 0 particularly to Fig. 3, there is shown a video conferencing 

system including a novel peripheral audio/visual 
communication system 70 adapted for communicating source 
audio and video signals 80 and 78 over a communication 
channel 82, and receiving remote audio and video signals 
35 transmitted over the communication channel 82 from a remote 

7 



conferencing site. In one embodiment, the peripheral 
audio/visual communication system 70 is configured to 
transmit remote video image signals to an external monitor 
76 received from a remote conferencing site over the 
5 communication channel 82 and remote audio signals to an 
internal speaker 90. In another embodiment, the 
audio/visual communication system 70 is configured to 
communicate with a separate host computer system 72 over a 
high-speed host computer system output interface 140. 

10 Video images received from a remote conferencing site are 
displayable over a display 74 coupled to the host computer 
system 72, and remote audio signal are preferably broadcast 
^over the internal speaker 90. 

Upon comparing the novel audio /visual 

15 communication system 70 shown in Fig. 3 to the prior art 
video teleconferencing systems discussed previously with 
respect to Figs. 1 and 2, it will become readily apparent 
that the stand-alone, peripheral audio/visual communication 
system 70 provides for a substantial reduction in the cost 

2 0 and complexity of effectuating video conferencing. Most 

noticeably, the peripheral audio/visual communication 
system 70 provides full-color, full-motion video 
conferencing between local and remote conferencing sites 
simply by connecting the audio/visual communication system 
25 70 to a communication channel 82, a standard telephone line 
for example, a video signal source 78, such as a video 
camcorder, and an external monitor 76, such as a television 
monitor, for viewing remote video images. A built-in 
microphone 91 is adapted to receive local audio signals for 

3 0 communication to a remote conferencing site, and an 

internal speaker 9 0 is provided for broadcasting the 
received remote audio signal. It is noted that the 
audio/visual communication system 70, being a compact, 
stand-alone peripheral, is well suited for easy transport 
35 to any one of a plurality of desired conferencing sites. 
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One important advantage of the novel audio/visual 
communication system 70 is the effective uncoupling, or 
separating, of the various audio and video processing 
assemblies needed to effectuate video conferencing from a 
5 host computer system 72. As illustrated in Fig. 4, all 

audio and video processing boards and components comprising 
the audio/visual communication system 70 are housed in a 
peripheral housing 115 completely separate from the' host 
computer system 72. Accordingly, a user of the peripheral 

10 audio/visual communication system 70 need not install any 
additional cards into the host computer system 72, nor is 
it necessary for the user to reconfigure the hardware or 
operating system software of the host computer system 72 to 
effect high-quality, full-feature video conferencing. A 

15 user need only couple the novel peripheral audio/visual 

conferencing system 70 to a host computer system 72 using a 
standard connection interface. 

As further shown in Fig. 4, a housing cover 114 
is detachably secured to a housing base 116 that together 

20 comprise a peripheral housing 115 within which the 

components of the audio/visual communication system 70 are 
housed. An input/output board 112 preferably includes 
means for communicating between a motherboard 100 and 
various connectors, jacks, and transceivers, mounted on a 

25 back panel 123 of .the peripheral housing 115, audio and 
video signals communicated between a local and remote 
conferencing site. A video board 104 processes remote 
video signals received from the communication channel 82, 
and transmits processed source video signals 7 8 acquired 

3 0 from a local conferencing site over the communication 
channel 82. The audio board 102 processes remote audio 
signals received from a remote conferencing site, and 
processes source audio signals 80 for transmission over the 
communication channel 82 . 



9 



An alternative configuration of the peripheral 
housing 115 is illustrated in Figs. 14 and 15. A housing 
case 117 is generally rectangular in configuration with 
open end portions being adapted for respectively receiving 
a front housing bezel 119 and a back panel 123. The front 
housing bezel 119 preferably supports a built-in speaker 90 
and a built-in microphone 91. A plurality of video, audio, 
communication channel, and power connectors and adapters 
are preferably mounted to the back panel 123. A light 121, 
preferably mounted to the housing case 117, alerts a user 
of the audio/ visual communication system 70 when visual 
conferencing is in session. It is noted that the light 121 
may include an incandescent, fluorescent, light emitting 
diode, or other known light source. 

The transmission and receiving of local and 
remote audio and video signals over, the communication 
channel 82 is preferably facilitated by a communications 
board 106 provided within the peripheral, housing 115. The 
communications board 106, being coupled to the motherboard 
100 and communication channel transceivers mounted to the 
pack panel 123, communicates with the audio board 102 and 
video board 104 to respectively transmit and receive local 
and remote audio and video signals over the communication 
channel 82 . It" is noted that the communication channel 82 
is generally connected to a domestic or international 
communication network 84. It is further noted that the 
communication channel 82 may be of a conventional analog 
POTS (Plain Old Telephone Service) configuration or a 
digital communication channel, such as an ISDN (Integrated 
Services Digital Network) digital communication channel. 
The communications board 106 preferably includes a high- 
speed modem for transmitting audio, video, and other 
informational signals over an analog communication channel. 
Power for the audio/ visual communication system 70 is 
provided by a power supply 108 coupled to the motherboard 

10 



100. The power supply 108 is preferably a universal 
switching AC power supply capable of delivering a wide 
range of supply voltages preferably between 85 and 2 64 
volts and at frequencies ranging between 47 and 63 hertz to 
5 operate on both domestic and international power delivery 
systems. 

An important feature of the novel audio/visual 
communication system 70 concerns the high-speed output 
interface board 110 adapted for communicating with a 

10 separate host computer system 72. In one embodiment, local 
and remote video signals are communicated to the separate 
host computer system 72 over a SCSI (Small Computer Systems 
Interface) interface or bus. In another embodiment, the 
output interface board 110 comprises a PCMCIA (Personal 

15 Computer Memory Card Industry Association) interface or bus 
for providing high-speed communications between the 
audio/visual communication system 70 and various types of 
host computer systems 72, including lap-top computer 
systems. 

20 Turning now to Fig. 6, there is shown an 

input /output panel 12 0 comprising an output interface panel 
122, an input interface panel 150, and a communication 
channel interface panel 170. The input/output panel 120 is 
preferably configured and mounted as a back panel 123 at 

25 the rear of the peripheral housing 115, and provides means 
for coupling the internal audio, video, and data processing 
assemblies of the audio/visual communication system 70 to 
audio and video signal sources 80 and 78, communication 
channel 82, host computer- system 72, external speaker 22 0, 

30 and monitor 76. The input interface panel 150 preferably 
includes a video input panel 151 and an audio input panel 
160. The video input panel 151 preferably includes a main 
video input jack 152 and an auxiliary video input jack 154 
for receiving NTSC (National Television Systems Committee) 

35 or PAL (Phase Altering Line) source video signals 78. A 



video camera or camcorder is generally connected to the 
main video input jack 152, while a, video cassette recorder 
(VCR) is usually connected to the auxiliary video input 
jack 154, although a second video camera may be used 
5 instead of a VCR. The video input panel 151 further 

includes main and auxiliary S-video input jacks 156 and 158 
for receiving video signals from an S-video source 78. 

The audio input panel 160 preferably includes a 
main audio input jack 162 and an auxiliary audio input jack 

10 164, with both audio inputs 162 and 164 preferably being 
2.0 volt line level compatible inputs. The main audio 
input jack 162 is generally connected to the audio output 
of a video camera or camcorder, wherein the audio output 
signal produced by the video camcorder's build-in 

15 microphone is received by the audio input jack 162. The 
auxiliary audio input. jack 164 is usually connected to a 
VCR, but may alternatively be connected to the audio output 
of a second video camera or camcorder. An external 
microphone jack 168 is provided to receive audio input 

20 signals from an external microphone, and, when utilized, 

preferably disables the built-in microphone 91 provided on 
the peripheral housing 115. 

The output interface panel 122 preferably 
includes a video output panel 125, an audio output panel 

25 127, and a host computer system interface panel 140. 
Remote video signals received over the communication 
channel 82 may be displayed on an external monitor 76 by 
coupling the external monitor 76 to the video output jack 
124. The output video signal provided at the output video 

3 0 jack 124 preferably conforms to one of either of the 

composite NTSC or PAL video signal standards. An S-video 
output jack 12 6 is provided to permit connectivity to an S- 
video device. 

The audio output panel 127 preferably includes a 

35 local audio output jack 128 and a remote audio output jack 
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130. The local audio output jack 128 is adapted to 
communicate audio signals produced at the local 
conferencing site, while the remote audio output jack 130 
is adapted to communicate remote audio signals received 
5 from a remote conferencing site. The local and remote 

audio output jacks 128 and 13 0 are usually connected to the 
left and right audio inputs of a stereo VCR. An external 
speaker jack 132 provides means for broadcasting remote 
audio signals over an external speaker 220. It is noted 

10 that remote audio signals are preferably routed to an 
internal speaker 90 unless an external speaker 22 0 is 
connected to the external speaker jack 132, wherein the 
internal speaker 90 is preferably disabled. 

In one embodiment, as discussed previously, the 

15 audio/visual communication system 7 0 provides stand-alone 
video conferencing capability simply by connecting audio 
and video signal source devices 80 and 78 to the audio and 
video input jacks 162 and 152 of the input interface panel 
150, connecting an external monitor 76 to the video output 

20 jack 124, and connecting a standard or digital telephone 

line to the communication channel interface panel 170. In 
this configuration, the audio/visual communication system 
70 provides full-color, full-motion video conferencing over 
a communication channel 82, with remote video images being 

25 displayed on the monitor 76 and the remote audio signals 
being broadcast over the internal speaker 90. 

The functionality of the video conferencing 
system illustrated in Fig. 3 is generally enhanced by 
coupling the peripheral audio/visual communication system 

30 70 to a separate host computer system 72, and operating 
visual conferencing application software preferably 
configured to operate on, and in cooperation with, the host 
computer system 72. The host computer system 72 and 
accompanying visual conferencing application software 

35 preferably cooperates with the audio/ visual communication 
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system 70 to coordinate the presentation of remote and 
local video images on a display 74 coupled to the host 
computer system 72, and to generally enhance the 
functionality of the audio/visual communication system 70. 
The visual conferencing application software package 
accompanying the audio/visual communication system 70 is 
preferably adapted to operate on most popular personal 
computer systems, and in cooperation with a variety of 
industry standard operating systems. In one embodiment, 
the visual conferencing application software package is 
preferably operable on both Windows® and Macintosh® based 
computer systems. 

The output interface panel 122 includes a host 
computer system interface or interface panel 140 which • 
preferably comprises an SCSI interface and output connector 
142 and/or a PCMCIA interface and output connector 148. A 
second SCSI or PCMCIA interface and output connector 144 is 
preferably available for daisy-chaining the audio/visual 
communication system 70 to other SCSI or PCMCIA peripheral 
devices. An identification (ID) switch 146 is provided to 
uniquely identify the audio/visual communication system 70 
amongst other peripheral devices communicating over a 
common SCSI bus 113. The host computer system 72, when 
coupled to the host computer interface panel 140, 
communicates with the audio/visual communication system 70 
pjreferably by issuing one or more coordination instructions 
to configure and coordinate the operation of the 
audio/visual communication system 70. 

Connectivity between the audio/visual 
communication system 70 and the communication channel 82 is 
preferably provided by a communication channel interface 
panel 170. Communication over either an ISDN digital 
communication line or a standard analog POTS communication 
line is selectably provided by an ISDN transceiver 172 and 
a. POTS transceiver 174. Video conferencing over a standard 

14 



analog POTS line is facilitated by an internal modem 
preferably operable at a baud rate of up to 28,800 kilobits 
per second (kbit) . The ISDN transceiver 172 is preferably 
connected to a Basic Rate ISDN (BRI) digital communication 
5 channel 82 which provides two 64 kbit data channels, one 16 
kbit voice channel, and one 64 kbit signaling channel. The 
communication channel interface panel 170 may further 
include an optional channel transceiver 176 for 
communicating over a Primary Rate ISDN (PRI) communication 

10 channel, a Tl line, a Switch-56 line, and various local and 
wide area networks. 

Referring now to Fig. 5, there is shown a 
generalized block diagram of a video conferencing system 
comprising a peripheral audio/visual communication system 

15 70 and a separate host computer system 72. Source audio 
and video signals 80 and 78 are received by an input 
interface panel 150 comprising appropriate input connectors 
and jacks. The input interface panel 150 is preferably 
configured to receive source video signals 78 provided by 

20 at least two video sources, including a video camera and a 
VCR, or, alternatively, two video cameras, for example. 
The source audio signal 80, received from a built-in 
microphone 91 or an external microphone, is preferably 
transmitted to the local audio processor 182 of the audio 

25 board 102. The source video signal 78 is preferably 

transmitted to the local video processor 186 of the video 
board 104. 

The local audio processor 182 receives the source 
audio signal 80 preferably from the audio input panel 160 

30 and transmits the source audio signal 80 to a central 

controller 200. A communication channel interface panel 
170 receives the source audio signal 80 transmitted from 
the central controller 200, and, in turn, transmits the 
source audio signal 80 to the communication channel 82. In 

35 one embodiment, the local audio processor 182 converts the 



source audio signal 80 to a corresponding compressed audio 
signal of a predetermined compressed format for efficient 
transmission over the communication channel 82.. 

The source video signal 78, received by the video 
5 input panel 151, is transmitted to a local video processor 
186 provided on the video board 104. The local video 
processor 186 processes the source video signal 78 for 
transmission to the central controller 200, which, in turn, 
transmits the source video signal 78 to the communication 

10 channel interface panel 170 for transmission over the 
communication channel 82. In one embodiment, the local 
video processor 186 converts the source video signal 78 
received from the video input panel 151 to a corresponding 
compressed video signal of a predetermined compressed 

15 digital format for efficient transmission over the 

communication channel 82, It is noted that the source 
video signal 78 and associated .audio signal 80 are 
generally transmitted together as a combined, audio/video 
signal over the communication channel 82. The central 

20 controller 200 preferably synchronizes the associated audio 
and video signals 80 and 7 8 when producing a combined 
audio /video signal , 

Still referencing Fig. 5, a remote audio signal 
transmitted over the communication channel 82 is preferably 

25 received by the communication channel interface panel 170 
and communicated to the central controller 200. The 
central controller 2 00 transmits the remote audio signal to 
a remote audio processor 184 provided on the audio board 
102. The remote audio processor 184 includes means for 

3 0 converting a compressed remote audio signal to a 

corresponding remote decoded audio signal. The remote 
decoded audio signal is preferably transmitted to an 
internal speaker 90 provided on the peripheral housing 115. 
The remote decoded audio signal is also transmitted to the 

35 output interface panel 122 and, specifically, to the 



external speaker jack 127. Connecting an external speaker 
220 to the external speaker jack 127 for broadcasting of 
the remote decoded audio signal preferably disables the 
internal speaker 90. 
5 The communication channel interface panel 17 0 

also receives remote video signals from the communication 
channel 82, and transmits the remote video signals to the 
central controller 200 for processing by a remote video 
processor 188 provided on the video board 104.. The remote 

10 video processor 188 typically receives from the central 
controller 200 a compressed remote video signal which is 
preferably converted by the remote video processor 188 to a 
corresponding remote decoded video signal. The video board 
104 preferably transmits the remote decoded video signal to 

15 both the host computer interface panel 140 and the video 
output panel 125. An external monitor 76 may be connected 
to the video output panel 125 for displaying thereon a 
video image associated with the remote decoded video 
signal. The remote decoded video signal received by the 

2 0 host computer interface panel 140 is preferably transmitted 

to the SCSI interface 142 or PCMCIA interface 148 for 
communication to a separate host computer system 72 coupled 
to the audio/visual communication system 70. The host 
computer system 72 preferably coordinates presentation of a 
25 video image associated with the remote decoded video signal 
on a display 74 coupled to the host computer system 72. 

In one embodiment, the video board 104 transmits 
the source video signal 78 together with the remote decoded 
video signal to the host computer interface panel 140. A 

3 0 combined video signal corresponding to the combination of 

the source video signal and remote decoded video signal is 
preferably transmitted to the host computer system 72 for 
simultaneous presentation on the display 74. Video images 
associated with the source and remote decoded video signals 
35 may respectively be displayed as side-by-side video images. 



picture- in-picture images, or any other desired combination 
of source and remote video images on the display 74. The 
formatting and presentation of the source and remote video 
images is preferably controlled by the host computer system 
5 72, and more typically, by cooperation of the host computer 
system 72 and visual conferencing application software 
operating thereon. 

The audio and video processors 182, 184, 186, and 
188 of the audio and video boards 102 and 104 preferably 

10 comprise compression and decompression chips, also termed 
CODEC chips, that encode and decode audio and video signals 
in accordance with an internationally, recognized video 
conferencing standard. One suitable video conferencing 
standard is the CCITT H.320 standard promulgated by the 

15 Consultative Committee on International Telephony and 
Telegraphy, a body of the International Telegraph Union 
(ITU) established by the United Nations, The CCITT H.320 
video conferencing standard includes several sub-standards , 
including a video compression standard H.261, a channel 

20 coding standard H.221, and audio compression standards 
G.711, G.722, and G.728. 

In accordance with another embodiment, the 
audio/ visual communication system 70 includes a high-speed 
data pipe or data transfer capability for transferring data 

25 files over the communication channel 82. A standard data 
file residing on the host computer system 72, for example, 
is preferably transmitted to a data file processor 202 
through the host computer interface panel 140. The central 
controller 200 then receives the data file transmitted from 

30 the data file processor 202, and, in turn, transmits the 

data file to the communication channel interface panel 170. 
The communication channel interface panel 170 transmits the 
data file through an ISDN transceiver 172 when the 
audio/visual communication system 70- is communicating over 

35 an ISDN digital communication channel 82. A high-speed 
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modem (not shown) , preferably provided on the communication 
board 106, receives the data file from the central 
controller 200 prior to transmitting the data file over the 
POTS transceiver 174 of the communication channel interface 
panel 170. The data file processor 202 preferably 
comprises data compression means to convert a standard data 
file to a compressed data file of a predetermined 
compressed format for high-speed transmission over the 
communication channel 82 . • 

A remote data file may be received by the 
communication channel interface panel 170, transmitted to 
the central controller 200, and then communicated to the 
data file processor 202. A compressed remote data file is • 
preferably converted to a standard remote data file by the 
15 data file processor 202, and transmitted to the host 
computer system 72 through the host computer interface 
panel 140. A high-speed data file transfer link or data 
pipe between a local host computer system 72 and a remote 
host computer system (now shown) is thus facilitated by the 
data file processor 202 of the audio/visual communication 
system 70 . ~ 

The data file processor 2 02 preferably provides 
high speed, bi-directional data communication between two' 
host computer systems communicating over the communication 
25 channel 82. Various visual conferencing enhancement 

features including file transfer, screen sharing, document 
collaboration, and other data exchange features are 
provided by the data file processor 202 operating in 
conjunction with two host computer systems communicating 
over the communication channel 82. Coordination of the 
file transfer process is both simple and reliable. The 
local host computer system 72, for example, preferably 
transfers data blocks over the communication channel 82 
whenever a local SEND status is set to TRUE, thus 
35 indicating that the remote host computer system is 

19 
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available to receive the data blocks. • The local SEND 
status is. initially set to TRUE and then subsequently' set 
to FALSE after transmitting a first set of data blocks. 
The remote host computer system, after receiving the first 
5 set of data blocks from the communication channel 82, 

preferably transmits an OK SEND status signal to the local 
host computer system 12, but only after receiving the first 
set of local data blocks without error. Additional data 
blocks may then be transferred by the local host computer 

10 system 72 to the remote host computer system. 

A RECEIVE status signal indicates that incoming 
data blocks have been received from a remote conferencing 
site and are awaiting processing at the local conferencing 
site. The local host computer 72 will transmit an OK SEND 

15 status signal to the remote conferencing site after 

processing the incoming data blocks. If a data block is 
not properly communicated between a local host computer 
system 72 and a remote host computer system, the data block 
will automatically be re-transmitted. Accordingly, an . 

20 overrun condition or loss of data is precluded. The data 
file processor 2 02 preferably handles all handshaking, and 
error detection/correction procedures. The data file, 
processor 202 preferably further comprises double buffering 
means to ensure optimum utilization of the communication 

25 channel 82 transmission bandwidth. Double buffering 

permits the local host computer system 72, for example, to 
transmit a second set of data blocks to the data file 
processor 202 while the first set of data blocks is being 
transmitted over the communication channel 82. 

30 An important feature provided by the high-speed 

data pipe or data transfer capability of the audio/visual 
communication system 72 is the ability to view and modify a 
document simultaneously displayed at a local and remote 
conferencing site. Referring now to Figs. 11 and 12, the 

3 5 novel peripheral audio/visual conferencing system operates 

20 



in cooperation with a host computer system and visual 
conferencing application software to provide window sharing 
• and document collaboration functions which may be initiated 
at either a local or remote conferencing site. It is 
5 assumed that the operating systems of the local and remote 
host computer systems 244 and 2 64 are capable of operating 
• at least one software application within one of a plurality 
of activatable software application windows. 

A user of the local host computer system 244, for 

10 example, preferably initiates window sharing and document 
collaboration by first opening one or more local 
application windows at step 622. A local window menu 600 
is constructed at step 624 using as entry selections the 
names or designations of the previously opened window 

15 applications. The local window menu 600 is preferably 

updated at step 62 6 to include the names or designations of 
all subsequently opened application windows immediately 
before the local window menu 600 is pulled down into the 
foreground of the local display 248 by the user. The 

20 applications associated with each of the windows are 

preferably presented in alphabetical order for selection in 
the menu 600. 

The user, at step 628, then selects a local 
active application window 602 from the menu 600 for sharing 

25 with a remote conferencing site. The local host computer 
system 244, at step 630, preferably allocates an 
appropriate amount of system memory to accommodate a local 
off-screen window buffer 604 and a local pixel update table 
606. A copy of the pixels or pixel data defining the local 

30 active window 602 is transferred to the local off-screen 
window buffer at step 632. The local active window 
selected from the window menu 600 is then brought to the 
foreground of the local display 248 at step 634. All of 
the pixels comprising the video image within the local 

3 5 active window 602 presented in the foreground of the local 



display 248 are then copied to the local off-screen buffer 
604 at step 636 . 

The visual conferencing application software 
senses whether a local draw command has been issued at step 
5 638, typically by sensing the activity of a mouse or 
keyboard comprising the local user interface 246, for 
example. In response to the local draw command, the local 
active window 602 pixels affected by the local draw command 
are modified or updated at step 642. The modified pixel 

10 data within .the local active window 602 is recorded or 
updated in the local pixel update table 606 at step 640, 
typically in the form of pixel characteristic and location 
data. The updated pixel data is then copied to the local 
off -screen window buffer 604 at step 636, thus resulting in 

15 a mirror image of the local active window 602 being 

maintained in the locaL off-screen window buffer 604. In 
practice, original pixel data residing at particular memory 
locations in the off -screen window buffer 604 is generally 
overwritten or replaced by modified pixel data 

20 corresponding to the same memory locations. 

At an appropriate time, the pixel data residing 
in the local off-screen window buffer 604 is transferred to 
the local audio/visual communication system 242 for 
transmission over the communication channel 82 at step 644. 

25 It is noted that the transmission of local pixel data over 
the data pipe or communication channel 82 at step 644 may 
proceed subsequently to or concurrently with the processing 
steps 638, 642, 640, and 636 associated with modifications 
made to the local active window pixels. 

30 A remote host computer system 264 preferably 

operates visual conferencing application software 
substantially similar to that operating on the local host 
computer system 244 to enhance video conferencing between 
the local and remote conferencing sites. After 

3 5 establishing a communication link between the local and 

22 



remote host computer systems 244 and 264, a full update of 
pixel data associated with the video image of the entire 
local active window 602 as reflected in the local off- 
screen window buffer 604 is initially transmitted over the 
communication channel 82 and received by the remote 
audio/visual communication system 262 at step 650. The 
pixel data associated with the entire local active window 
602 is first copied to the remote off-screen window buffer 
610 at step 652, and subsequently transferred at step 654 
to the remote active window 608 presented in the foreground 
of the remote display 268. 

The modified local pixel data transmitted at step 
644 over the data pipe 82 is received at the remote 
conferencing site at step 650, then copied to the remote 
off -screen window buffer 610 at step 652, and subsequently 
transferred at step 654 to the remote active window 608 to 
update the video image presented therein. It is noted that 
a conferencing party at the remote conferencing site may 
also effect changes to the document or application 
currently shared within the local and remote active windows 
602 and 608. The remote host computer system 264 
preferably cooperates with visual conferencing application 
software to coordinate window sharing, modification, and 
updating in a manner similar to that previously discussed 
with respect to the local host computer system 244. 

During the window sharing and document 
collaboration procedures, a full update or transfer of all 
of the pixel data associated with either of the local or 
remote active windows 602 and 608 is generally performed 
when a conferencing party modifies the size of an active 
window, or upon the expiration of a predetermined amount of 
time programmed into an automatic timer for the purpos.e of 
periodically refreshing the local and remote active windows 
602 and 608. It is noted that the incremental updating of 
pixel information is performed primarily to optimize usage 



of the available transmission bandwidth of a limited 
bandwidth communication channel 82 . Other optimization 
schemes, such as converting the pixel data to a compressed 
format using one of a number of standard compression 
5 schemes, may be employed to increase the transfer 
efficiency of pixel data between a local and remote 
conferencing site. when performing window sharing and 
document collaboration functions. 

One important advantage of the novel audio/visual 

10 communication system 72 concerns its operability with a 
variety of host computer systems 7 0 and associated 
operating systems. The ability to communicate with 
virtually all popular host computer system platforms and 
operating systems significantly enhances the portability of 

15 the audio/visual communication system 70, and makes full- 
feature video conferencing readily available for most 
commercial, governmental and personal uses. Multiple 
platform and cross-platform (operation between two computer 
computers operating under dissimilar operating systems) 

20 video conferencing is preferably facilitated by visual 
conferencing application software operable on the host 
and/or remote computer systems. 

In general, the manner in which video data is 
processed by a given host computer system 72 differs from 

25 one computer system manufacturer to another. Since no 
single video processing standard has been adopted for 
exclusive use among the manufacturers of host computer 
systems, the novel audio/ visual communication system 70 
preferably performs substantially all of the significant 

3 0 video processing tasks prior to transferring the video data 
over an output interface 140 to an attached host computer 
system 72 for presentation on a display 74. Virtually all 
popular host computer systems 72 are generally configured 
to communicate over one of a limited number of - standard 

35 output interfaces 140, such as an SCSI 142 or PCMCIA 148 



interface, for example. The audio/visual communication 
system 70 provides processor-independent compatibility with 
virtually all popular host computer systems 72 by 
formatting video data into a form amenable for transmission 
5 over the standard output interface 140 and processing by a 
specific host computer system 72 coupled to the 
audio/visual communication system 70. 

A host computer system 72, in cooperation with 
visual conferencing application software operating thereon, 

10 preferably issues a variety of coordination instructions to 
the audio/visual communication system 70 to facilitate 
video conferencing between a local and remote conferencing 
site. The host computer system 72 preferably coordinates 
the transfer of video frame data between the audio/visual 

15 communication system 72 and the host computer system 72. 

The host computer system 72 preferably issues 
read and write request instructions to the audio/visual 
communication system 70 to coordinate the transfer of video 
data therefrom in a manner similar to that when 

2 0 communicating with other peripheral devices, such as a disk 

drive array, for example. In response to the read and 
write request instructions, the audio/visual communication 
system 70 transfers a requested number of video frames and 
other configuration parameters between the host computer 
25 system 72 and the audio/visual communication system 70. In 
accordance with this embodiment, the audio/visual 
communication system 70 operates in a slaved relationship 
with the host computer system 72, whereby all coordination 
instructions are produced by the host computer system 72, 

3 0 and responded to by the audio/visual communication system 

70. 

In one embodiment, the output interface 140 
comprises a SCSI interface 142, wherein communication 
between the host computer system 72 and the audio/visual 
35 communication system 70 conforms to one of a number of 

25 



standard SCSI communication protocols, such as SCSI-1 and 
SCSI-II protocols. The host computer system 72 preferably 
, produces coordination instructions or commands in the form 
of parameter blocks. Each parameter block typically 
5 includes an operation code or opcode field that specifies 
the particular operation to be executed, and also includes 
associated data and , configuration parameters used to 
perform the operation. The opcode field is generally 
included at the beginning of the parameter block followed 

10 by configuration parameters and data unique to the 

particular opcode. The configuration parameters typically 
specify the type of video data and manner in which the 
video data transferred from the audio/visual communication 
system 72 is to be presented on- the display 74. 

15 For example, as illustrated in Fig. 10, there is 

shown a generalized diagram of typical information 
contained in a read or write coordination instruction 500 
suitable for coordinating communications between the host 
computer system 72 and the audio/visual communication 

20 system 70 over the SCSI output interface 142. It is to be 
understood that peripheral communication protocols other 
than one conforming to the SCSI standard may be employed 
for effecting communications between the host computer 
system 72 and the audio/visual computer system 70 without 

25 departing from the scope of the present invention. By way 
of example, a PCMCIA interface 148 and associated 
communication protocol may be employed. 

By way of illustration and not of limitation, the 
coordination instruction 500 shown in Fig. 10 comprises an 

30 ID field 508, a logical unit number field 506, and a data 
block field 502 which includes a lead data byte field 504. 
Both read and write request instructions are preferably 
structured to include the informational fields ■ illustrated 
in Fig. 10. The ID field 508 provides an identification to 

35 distinguish the audio/ visual communication system 7 0 from 



other peripheral devices coiranunicating over the SCSI bus 
113 . The logical unit number field 506 preferably 
indicates the type of data being transferred when the host 
computer system 72 is reading or transferring the 
5 information from the audio/visual communication system 72 . 
The logical unit number field 506 is preferably implicated 
only during a read transfer operation, and is typically set 
to zero during write transfer operations. 

The type of data transferred during a write 

10 transfer operation is preferably indicated by the lead data 
byte field 504 of the data block field 502. The specific 
information or data being transferred during either a write 
or read transfer operation is preferably contained within 
the data block field 502. The number of bytes comprising 

15 the data block field 502 is preferably dependent on the 
specific type of read or write request instruction 
generated by the host computer system 72. For example, if 
the host computer system 72 issues a read request 
instruction to transfer video information from the 

2 0 audio/visual communication system 7 0 to the host computer , 

system 72, a predetermined fixed number of video data bytes 
are transferred in the data block field 502. When the host 
computer system 72 issues a read status request 
instruction, for example, a predetermined fixed number of 
25 data bytes associated with the status information is 
transferred in the data block field 502. By further 
example, when the host computer system 72 writes a block of 
data to the audio/visual communication system 70, the size 
of the data block 502 being transferred is obtained from 

3 0 reading the lead data byte field 504 within the data block 

502. Accordingly, each specific type of coordination 
instruction 500 has an associated predefined data block 
field 502 size. 

In further reference to Fig. 10, the video data 
35 contained in the data block field 502 is shown organized in 



an RGB format for a matrix of pixels, with each pixel 
corresponding to a single dot or point on a color 
television screen or monitor. Each pixel 512, in turn-, 
consists of red, green, and blue color components. The 
format of the pixel data of the data block 502 is 
preferably dependent on the particular CPU (Central 
Processing Unit) of the host computer system 72. The red, 
green, and blue color components are preferably quantified 
in three 5-bit fields 516, 518 and 520 for 

M.acintosh^/Motorola® based computer systems. In practice, 
16 bits, or two 8-bit bytes, of color component data are 
preferably associated with each individual pixel 512. As 
such, the extra most significant bit 514 is preferably set 
to zero. For host computer systems 72 employing a 
Windows /Intel based architecture, the color components of 
each pixel 532 are preferably quantified in the following 
field sequence 53 0: a 3-bit green field 536, a 5-bit blue 
field 538, a 1-bit unused field 540, a 5-bit red field 542, 
and a 2-bit green field 544. It can be appreciated that 
quantifying the three color components for each pixel 
configuration 512 and 532 in accordance with these 
preferred formats provides up to 32,678 (2^^) color 
combinations. It is to be understood that other host 
processors or CPUs can be employed in the host computer 
system 72 other than those discussed above, and that the * 
audio/visual communication system 72 can accommodate such 
other processors by formatting the pixel data associated 
with local and remote video signals in a form amenable for 
transmission over the output interface 140 and processing 
by the particular host computer system 72. 

It is noted that a video data block field 5 02 
generally consist of a repeating sequence of red, green, 
and blue data fields (not necessarily in this order) , with 
the maximum amount of pixel data comprising a single data 
block field 502 being limited by the predefined data block 
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5 02 size dictated by the specific read or write video 

transfer coordination instruction being executed by the 

host computer system 72. A typical coordination 

instruction associated with a picture-in-picture display 

command parameter block, for example, is represented in the 

following C language software code: 

OSErr VBoxHostPIP (long width, long height, long 
croppedX, long cropped Y, long top, long left, 
RGBColor borderColor short borderWidth) 
{ 

OSErr err; 
S_HOST_PIP op; 



op . opcode=OP_HOST_PlP; 
op . sizex^width; 

op. sizey=height ; 
op .cropx^croppedX; 
op . cropy=croppedY; 
op .posxFtop; 
op .posy=lef t ; 
op . color=0; 

op . color=borderColor . red»ll ; 
op . color<<=5 ; 

op . color+=borderColor . green»ll ; 
op . color<<=5 ; 

op . color+=borderColor . blue»ll ; 
op . width=borderWidth ; 

gHeader [ 0 ] =kSCSIWrite ; 
block header 
gHeader [1] =0 ; 

.gHeader [2] =sizeof (S_HOST_PIP) »16; 
gHeader [3 ] =sizeof (S_HOST_PIP) »8; 
gHeader [4] =sizeof(S_HOST_PIP) ; 
gHeader [5] =0; 



// parameter block 

// opcode 
// specific 
parameters 



// SCSI command 
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gBlock[0] .scOpcode=scNoInc; // SCSI command 

gBlock[0] . scParaml= (long) Scop; 

gBlock[0] . scParam2=sizeof ) S_HOST_PIP) ; 

gBlock[l] , scOpcode=scStop; 

gBlock[l] .scParaml=0; 

gBlock[l] .scParam2=0; 

err=VBoxWaitBusy() ; 

if (err == no Err) 

return (SendWriteCommandToSCSK&gHeader , gBlock) ); 
else 
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return err; 

The software code for other coordination instructions 
structured in parameter block form is preferably similar to 
that presented for illustrative purposes above, for a 
picture-in-picture display instruction. 

The coordination instructions associated with the 
transfer of data file information over the communication 
channel 82 to facilitate window sharing and document 
collaboration are preferably structured in a parameter 
block command format similar to that illustrated in Fig. 
10. An operation code or opcode and block length code are 
typically the first two parameters of a data transfer 
coordination instruction. These two parameters preferably 
define the structure and content of the entire 
data transfer parameter block command. The opcode for a 
draw or display coordination instruction, for example, will 
typically include a display opcode, a. block length code 
defining the total number of bytes comprising the parameter 
block command, and a plurality of pixel data indicating the 
screen location and characteristics of each pixel. 

To facilitate data transfer across different host 
computer platforms, the pixel data may include additional 
data in the form of embedded commands that instruct a 
particular CPU to process the pixel data in a particular 
manner. When two similar CPUs are communicating data 
across the communication channel 82, the imbedded command 
may be ignored or, preferably, not included in the pixel 
data. When cross-platform transferring of data is desired, 
the receiving CPU preferably extracts the imbedded commands 
and processes the associated pixel data as instructed. An 
embedded command, for example, may instruct the receiving 
CPU to swap the position of specific color bits within the 
block of pixel data as previously discussed with regard to 
Fig 10. 
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In another embodiment, as illustrated in Fig. 7, 
a local audio/visual communication system 242 and a remote 
audio/ visual communication system 2 62 are shown 
communicating over a communication channel 82 . The local 
5 and remote audio/visual communication system 242 and 2 62 
preferably operate in conformance with an industry 
recognized international communications standard, such as 
the CCITT H.32 0 standard. The local audio/ visual 
communication system 242 receives and processes local 
10 source audio and video signals 252 and 254 from a local 
conferencing site 240. The local source audio and video 
source signals 252 and 254 are then transmitted to a 
communication channel 82 . The local source audio and video 
signals 252 and 254 transmitted over the communication 
15 channel 82 are received and processed by the remote 

audio/visual communication system 262 situated at a remote 
conferencing site 260. 

Remote source audio and video signals 272 and 274 
are received and processed by the remote audio/visual 
20 communication system 262, transmitted over the 

communication channel 82, and received and processed by the 
local audio/visual communication system 242. A local 
monitor 281 may be coupled to the local audio/ visual 
communication system 242 to display remote video images 
25 received from the remote conferencing site 260. Similarly, 
a remote monitor 2 83 may be coupled to the remote 
audio/ visual communication system 262 to display local 
video images at the remote conferencing site 260. In 
accordance with this embodiment, full-color, . full-motion 
3 0 stand-alone video conferencing is provided by the local and 
remote audio/visual communication systems 242 and 2 62 
respectively coupled to local and remote monitors 281 and 
283, and to the communication channel 82. 

In accordance with another embodiment, each of 
35 the local and- remote audio/visual communication systems 242 



and 262 is respectively coupled to a local and host 
computer system 244 and 2 64 by a local and remote output 
interface 250 and 270. The local and remote host computer 
systems 244 and 264 respectively coordinate the transfer of 
5 video information between the local and remote audio/visual 
communication systems 242 and 262 for presentation on the 
local and remote . displays 248 and 268. Additionally, the 
local and host computer systems 244 and 264 preferably 
configure, adjust, and modify various operational 

10 parameters of the local and remote audio/visual 

communication systems 242 and 262, such as color, contrast, 
and brightness characteristics of processed video signals, 
volume settings of the internal or external speakers 90 and 
220, and connection to either an ISDN or POTS communication 

15 channel 82, for example. 

In accordance with the system configuration 
illustrated in Fig. 7, a visual conferencing application 
software package is preferably operable on each of the 
local and host computer systems 244 and 264. The visual 

20 conferencing application software package preferably 

enhances and expands the functionality of the audio/visual 
communication systems 242 and 262 in a manner previously 
discussed and further discussed hereinafter. The local 
host computer 244, for example, preferably issues one or 

25 more coordination instructions to the local audio/visual 
communication system 242 to orchestrate the transfer of 
video data received from the communication channel 82 by 
the local audio/visual communication system 242 for 
presentation on the local display 248. Further, the local 

3 0 host computer 244, in cooperation with the visual 

conferencing application software operating on the local 
host computer 244, preferably issues one or more 
coordination instructions to effect the transfer of data 
files between the local audio/visual communication system 

35 242 and the remote audio/visual communication system 2 62 



and remote host computer system 264, It is to be 
understood that the features and functionality discussed 
with reference to the local audio/visual communication 
system 242 are equally applicable to the remote 
5 audio/visual communication system 262. 

Another important feature provided by the 
cooperation of the local host computer 244 and the visual 
conferencing application software operating thereon, 
includes the capability to adjust or modify the effective 

10 transmission bandwidth of the communication channel 82 to 
enhance or degrade the relative transmission rates of 
audio, video, and data information transmitted over the 
communication channel 82. In one embodiment, the frequency 
bandwidth of the communication channel 82 allocated between 

15 video, audio, and data file sources is preferably 

modifiable by the host computer system 72 operating visual 
conferencing application software. The communication 
channel 82 transmission bandwidth is preferably adjustable 
in finite increments in compliance with the CCITT H.32 0 

20 standards specification. Any of the video, audio, and data 
file sources may be degraded or eliminated in order to 
enhance the relative performance of the other sources. 
Adjustment of the communication channel 82 transmission 
bandwidth allocated for audio, video, and data signal 

25 transmission is preferably made within constraints imposed 
by the CCITT H.320 communication standard, or other 
internationally recognized standard, to ensure that full- 
motion video conferencing is maintained. 

Another important feature realized through the 

30 cooperation of the local audio/visual communication system 
242 and the local host computer system 242 and associated 
visual conferencing application software is the capability 
to simultaneously display on either one of the local and 
remote displays 248 and 268 video images transmitted from 

35 both the local and remote conferencing sites 240 and 260. 
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The local audio/visual communication system 242, as later 
discussed in detail with reference to Fig. 8, receives and 
buffers both the local source video signal 254 and the 
remote source video signal 274, and produces a video signal 
5 representative of both local and remote video signals. The 
local host computer 244 then transmits the combined local 
and remote video signal for display on the local display 
248. 

A generalized flow diagram illustrating some of 

10 the more important coordination functions performed by the 
video conferencing application software is provided in Fig. 
13 . The video conferencing application software is 
preferably operated on both the local and remote host 
computer systems 244 and 264 to advantageously enhance the 

15 operation of the local and remote .audio/visual 

communication systems 242 and 262. A conferencing party 
typically initiates video conferencing by executing the 
visual conferencing application software at step 702. The 
user is preferably presented with various options by the 

20 presentation of a main menu at step 704, including options 
to configure the system, initiate visual conferencing, and 
to terminate the visual conferencing session. 

A conferencing party, at step 70 6, has the option 
to modify various parameters affecting the configuration 

25 and operation of the local audio/visual communication 

system 242 and the local host computer system 244. At step 
708, a user may select and modify various parameters 
affecting the communication between the local audio/visual 
communication system 242 and the communication channel 82. 

3 0 The user, for example, may specify whether visual 

conferencing will be established over a standard analog 
(POTS) or digital (ISDN) communication channel 82. An 
automated phone book database, for example, may be 
established and accessed to assist a user when establishing 

35 a communication link with a conferencing party. Other 
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communication features may be controlled at step 708, such 
as call logging which results in the recording of data 
associated with incoming, outgoing, or all communications 
made between the local audio/visual communication system 
5 242 and the communication channel 82. 

A conferencing party, at step 710, may select and 
modify various video features and settings affecting both 
the local monitor 281, coupled to the local audio/visual 
communication system 242, and the local display 248, 

10 coupled to the local host computer system 244. A liser may, 
for example, selectably decide to view video images 
associated with a local source video signal 254 received 
from either a main or an auxiliary video source 152 or 154. 
A picture-in-picture display mode may be enabled or 

15 disabled as desired. The color, brightness, contrast, and 
tint characteristics of the local source video signal 254 
may be adjusted, preferably through on-screen controls 
adjustable by a conferencing party via a local user 
interface 246. Further, the user may configure the local 

2 0 audio/visual communication system 242 at step 710 to 

display video images acquired from the local conferencing 
site, remote conferencing site, or both conferencing sites. 
Various configuration parameters affecting the 'local and 
remote audio may be modified at step 712. The gain of the 
25 audio input of a main camera or auxiliary camera, for 
example, may be modified by a conferencing party. The 
volume level of an internal or external speaker 90 and 220 
may also be controlled. 

The operational characteristics of the data pipe 

3 0 82 and data transfer capability of the local audio/ visual 

communication system 242 may be modified by a user at step 
714. The transferring of data files over the communication 
channel 82 may, for example, be given a high priority 
relative to video and audio data transmission, resulting in 
35 the continuous transfer of data bytes comprising the data- 
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files over the communication channel 82. Alternatively, 
the data file transfer process may be given a lower 
priority, resulting in the selective transmission of data 
bytes in the background to ensure optimum transmission of 
5 video and audio data through the communication channel 82 . 
A conferencing party may initiate visual 
conferencing from the main menu at steps 704 and 715, and 
establish a communication link between the' local 
conferencing site 240 and remote conferencing site 2 60 at 

10 step 718, typically by placing a call over the 

communication channel 82 to the remote conferencing site 
260. During visual conferencing, a user has the option to 
modify the system configuration at step 722 and, at step 
728, may modify various configuration and operational 

15 parameters discussed previously with respect to the 

configuration system option at step 706. A user may also 
modify the manner in which data files are transferred over 
the data pipe 82 at step 72 6, including, for example, 
options to modify the priority of data file transmission 

2 0 relative to video and audio signal transmission, and to 

encrypt and/or compress the data file prior to 
transmission. Visual conferencing between two conferencing 
sites 240 and 260 may continue until a conferencing party 
decides to terminate the video conferencing session, 

25 typically by exiting the visual conferencing software 

application at step 720 or from the main menu at step 704. 

During a video conferencing session between a 
local and remote conferencing site 240 and 2 60, a 
conferencing party may wish to initiate window sharing or 

30 document collaboration at step 724. An option to share a 
window presented either in color or black and white is 
preferably provided to ensure reliable and continuous 
window sharing and/or document collaboration over a limited 
bandwidth communication channel 82. Conferencing parties, 

3 5 for example, may initially share a window presented in 
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color, and subsequently switch to a black and white 
presentation at step 73 0 if a degradation in picture 
quality, typically resulting from a reduction in the rate 
of data transmission over the data pipe 82, is experienced. 
5 A user may select a particular window for sharing, or 
choose to stop or start window sharing or document 
collaboration as desired at step 73 0. The features and 
functions discussed hereinabove with regard to the video 
conferencing application software operating in cooperation 

10 with a host computer system are provided for illustrative 
purposes only, and exemplify the. enhanced functionality of 
a local and remote audio/visual communication system 240 
and 262 when respectively coupled to local and remote host 
computer systems 244 and 264. 

15 Another important feature concerns a novel video 

conferencing software routine or application operable on 
either or both of the local and remote host computer 
systems 244 and 2 54 for monitoring incoming communications 
received over the communication channel 82 . An incoming 

20 communication sensing software application, operable on the 
local host computer 244, for example, preferably monitors 
signal traffic over the local output interface 250. The 
sensing software routine preferably operates independently 
of any other application software operating on the local 

25 host computer 244, and is preferably both inaccessible to 
and imperceptible by a user of the local host computer 244. 
As such, the sensing software routine operates in the 
background and continuously monitors for incoming 
communications by polling the local output interface 250 on 

3 0 a regular basis, once every second, for example. Upon 
sensing an incoming communication, the sensing software 
routine preferably initiates or executes an alerting 
software application or routine which preferably alerts the 
user of the local audio/visual communication system 242 to 
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the incoming communication received over the communication 
channel 82 . 

The alerting software routine preferably 
interrupts the current operation of the visual conferencing 
application software or any other application software 
currently operating on the local host computer 244, and 
presents the user with a plurality of options, including an 
option to answer or ignore the incoming communication. The 
local host computer 244, in response to a user's decision 
to answer the incoming communication, preferably issues an 
answer coordination instruction to the local audio/visual 
communication system 242, and executes a visual 
conferencing application software routine to . receive and 
respond to the incoming communication. 

The user of the local audio/visual communication 
system 242 preferably interfaces with the local host 
computer 244 through a local user interface 246 coupled to 
the local host computer system 244. The local user 
interface 246 is preferably a graphical user interface • 
which, in cooperation with a mouse and keyboard coupled to 
the local host computer system 244, provides a user with 
means for communicating coordination instructions between 
the local host computer system 244 and the local 
audio/visual communication system 242. It is noted that 
graphical user interfaces, such as those developed for 
Windows® and Macintosh® based computer* systems, are 
commonly used to control or operate a host computer system. 
Those skilled in the art may create new or modify existing 
graphical user interface programs to include functionality 
for controlling the operation of the local host computer 
244 and, in cooperation with the visual conferencing 
application software, the local audio/visual communication 
system 242 . 

Figs. 8 and 9 illustrate in block diagram form 
the various audio and video processing components 
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comprising the audio and video boards 102 and 104 
illustrated in Figs. 4 and 5. An important advantage of 
the video board 104 comprising the audio/visual 
communication system 72 concerns the automatic detection 
5 and processing of video signals produced by either an NTSC 
or a PAL video source. The video processing block diagram 
3 00 shown, in Fig. 8 includes a central controller 200 which 
coordinates transmission and reception of video signals 
communicated over the communication channel 82. Local 

10 video signals produced at the local conferencing site 240 
by either an NTSC or a PAL video camera are preferably 
received by the main video input jack 152 and/or the 
auxiliary video input jack 154. 

The NTSC standard prescribes a video frame rate 

15 of thirty video frames per second, while the PAL standard 
specifies a video frame rate of twenty- five video frames 
per second, to maintain full-motion video. A single frame 
of motion video typically comprises an even field and an 
odd f ield. The NTSC /PAL decoder 3 02 preferably converts a 

20 local NTSC or PAL video signal to corresponding local 
decoded video image or pixel data at the output of the 
NTSC/PAL decoder 302. Automatic, detection and 
determination of the video signal format is performed by 
the NTSC/PAL decoder 3 02 as it processes the header 

25 information and other constituent data comprising a PAL and 
an NTSC video signal. 

The decoded local video pixel data is typically 
of an RGB (Red, Blue, Green) or YUV (luminance Y, and color 
difference signals U and V) video format. The NTSC/PAL 

3 0 decoder 3 02 preferably decodes local NTSC video signals to 
corresponding CIF240 resolution image data (352 x 244) , and 
local PAL video signals to corresponding CIF resolution 
image data (3 52 x 2 88) . A CIF240 resolution image is 
recognized as a standard image format for domestic video 
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display devices, while. a GIF (and QCIF) resolution image is 
recognized as an international standard image format. 

When processing an even field of a local NTSC or 
PAL video frame, the NTSC/PAL decoder 302 preferably 
5 transmits to the local video encoder 3 04 1 ocal CIF240 or 
GIF resolution image data depending on whether the local 
video source device is an NTSC or PAL camera. The local 
video encoder 3 04 preferably includes scaling circuitry 
that scales a local GIF240 resolution image to an 

10 appropriate GIF resolution image. When processing an odd 
field of the local NTSG or PAL video frame, the NTSG/PAL 
decoder 3 02 preferably transmits the previously decoded 
even field GIF image data or pixels to the local frame 
buffer 3 06 while simultaneously decoding the next odd field 

15 of the local video frame. This decoding scheme is 

preferably repeated for subsequent local NTSG or PAL video 
frames received by the NTSG/PAL decoder 302. The local 
video encoder 3 04 preferably comprises circuitry to convert 
local decoded video image data, typically in YUV format, to 

2 0 corresponding compressed local video image data. A 
suitable local video encoder 3 04 is model AV4310A 
manuf actured by AT&T, and a suitable NTSG/PAL decoder 3 02 
is model SAA7194 manufactured by Philips. 

The local frame buffer 3 06 preferably comprises' 

2 5 DRAM memory (Dynamic Random Access Memory) sufficient to 

temporarily store or buffer the data for 256,000 local 
pixels, which is sufficient to buffer two local GIF. images. 
Thus, the local frame buffer 3 06 provides double buffering 
of the local GIF video image data which, together with the 

3 0 line buffer 314 disposed between the local video encoder 

3 04 and the local frame buffer 3 06, enhances cooperation 
and processing between the NTSG/PAL decoder 302, local 
video encoder 3 04, and local frame buffer 3 06. The 
reconstructed, local GIF image data buffered in the local 
3 5 frame buffer 3 06 may then be routed to one or both of the 



output interface 140 and/or the video output jack 124 as 
local decoded video signals. The local decoded video image 
data processed by the NTSC/PAL decoder 3 02 and local video 
encoder 3 04 is also preferably transmitted to the central 
5 controller 2 00 for communication over the communication 
channel 82 . 

Remote video image signals transmitted over the 
communication channel 82 are preferably received at the 
central controller 200 and transferred to a remote video 

10 decoder 318. The remote video signals are converted by the 
video decoder 318 to corresponding remote decoded video 
frame data. The reconstructed frame data is buffered in a 
line buffer 324 and temporarily stored in a remote frame 
buffer 320. In a manner similar to that previously 

15 described with respect to the NTSC/PAL decoder 302 and 

local video encoder 304, even and odd fields of the remote 
video frame data are successively decoded and reconstructed 
into remote decoded video frame data, and buffered by 
cooperative processing between the remote video decoder 318 

20 and the remote frame buffer 320. The remote decoded video 
frame data is then routed to one or both of the output 
, interface 140 and/or the video output jack 124 as remote 
decoded video signals. 

Another important advantage provided by the 

25 audio/ visual communication system 70 concerns the 

simultaneous displaying of decoded local and remote video 
images on a video monitor 76 or a display 74 coupled to a 
separate host computer 72 . In one embodiment illustrated 
in Fig. 8, an output multiplexer 308 receives decoded local 

3 0 and remote video frame data respectively from the local 

frame buffer 306 and remote frame buffer 320. The combined 
decoded local and remote video frame data is buffered in a 
line buffer 310 for output to the separate host computer 72 
over the host computer output interface 140. The line 

35 buffer 310 is provided preferably to enhance the 



transmission of the decoded local and remote video frame 
data between the output multiplexer 3 08 and the host 
computer output interface 140. 

The combined local and remote video frame data 
5 may then be presented as various combinations of associated 
local arid remote video images on the display 74 coupled to 
the host computer system 72. The host computer system 72 
preferably issues one or more coordination instructions to 
control the routing of the decoded local and remote video 

10 image data between the host computer output interface 140 
and the video output jack 124. The output multiplexer 308 
may, for example, be instructed to transmit only the remote 
video frame data to the host computer output interface 14 0, 
rather than the local video frame data or the combined 

15 local and remote, video frame data. 

The decoded local and remote video frame data may 
also be routed to the video output jack 124 through the 
display multiplexer 330. The display multiplexer 33 0 
preferably controls the video frame data transmitted to the 

20 NTSC/PAL encoder 332 and video output jack 124. Local, 

remote, or combined local and remote video frame data may 
be transmitted to the NTSC/PAL encoder 332 through the 
display multiplexer 330. The video frame data transmitted 
by the display multiplexer 330 are converted to an NTSC or 

25 PAL format by the NTSC/PAL encoder 332 for communication to 
the video output jack 124 for eventual display on an NTSC 
or PAL monitor 76 coupled thereto. 

The NTSC/PAL encoder 332 is preferably configured 
to receive 704 pixels from the display multiplexer 33 0 

3 0 corresponding to a CIF video image residing respectively in 
each of the local and remote frame buffers 306 and 320 (352 
local pixels and 352 remote pixels provide 704 total 
pixels) . If it is desirable to display only the local or 
remote video images on the monitor 76 coupled to the video 

35 output jack 124, the display multiplexer 330 preferably 

42 




performs l-to-2 upscaling (3 52 x 2) by latching and holding 
the 3 52 local or remote pixels prior to being transferred 
to the NTSC/PAL encoder 332. If side-by-side presentation 
of the local and remote video image is desirable, the 
5 display multiplexer 33 0 respectively transfers all 352 

pixels from each of the local and remote frame buffers 3 06 
and 320 to the NTSC/PAL encoder 332. A suitable display 
multiplexer 33 0 is model XC3 03 0 manufactured by Xilink, and 
a suitable NTSC/PAL encoder 332 is model SAA179 

10 manufactured by Philips. 

Turning to Fig. 9, there is shovm a block diagram 
of the various system components 400 that process local 
audio signals acquired from a local conferencing site and 
remote audio signals received over the communication 

15 channel 82 . Local audio signals are preferably input to an 
audio processor 406 from a plurality of audio sources. 
Main and auxiliary input jacks 164 and 162 are respectively 
provided on the peripheral housing 115 of the audio/visual 
communication system 70 for receiving local audio signals 

2 0 typically from the audio outputs of a video camera or 
camcorder. The audio processor 406 may further receive 
local audio signals from an internal microphone 91, and 
external microphone coupled to the external microphone jack 
168, or a standard telephone microphone 440 coupled to a 

25 converter 410 that converts the audio telephonic signal to 
an appropriate analog audio signal. In one configuration, 
an input multiplexer 408, with its output coupled to the 
audio processor 406, provides means for combining local 
audio signals produced by the microphone 91 and the 

30 telephone 440, The audio processor 406 preferably includes 
analog-to-digital converter for converting local analog 
audio signals to corresponding local digital audio signals. 

The local audio signals received by the audio 
processor 406 are transmitted to and processed by the audio 

35 encoder 404. The audio encoder preferably comprises audio 

43 



/ 



coding circuitry to convert local digital audio signals to 
corresponding local compressed audio signals. The local 
digital or compressed audio signals are then transmitted to. 
the central controller 200 which coordinates transmission 
5 of the local audio signals over the communication channel 
82. A suitable audio encoder 404 is model DSP3210 
manufactured by AT&T. 

Remote audio signals transmitted over the 
communication channel 82 are preferably received by the 

10 central controller 2 00 and transmitted to an audio decoder 
402. The remote audio signals are typically serial 
compressed audio signals conforming to one of a number of 
industry standard compression formats. The audio decoder 
402 preferably converts the remote compressed audio signals 

15 to corresponding remote digital audio signals. 

Additionally, the audio decoder 402 preferably comprises 
echo cancellation circuitry to filter out a local, audio 
signal that is transmitted from a local conferencing site, 
received at a remote conferencing site, and then re- 

20 transmitted back to the local conferencing site and 
received again by the audio decoder 402. 

The audio processor 406, preferably including a 
digital-to-analog converter, converts remote digital audio 
signals to corresponding remote analog audio signals for 

25 transmission to a plurality of audio outputs, including a 
local audio output jack 128, a remote audio output jack 
130, an external speaker jack 132, and an amplifier 414 
coupled to an internal speaker 90. A suitable audio 
decoder 402 is model DSP3210 manufactured by AT&T. It is 

30 noted that the central controller 200 cooperates with audio 
and video processing components 400 and 300 and the 
communication channel 82 to maintain optimum audio/visual 
communication system 7 0 operation. 

It will, of course, be understood that various 

3 5 modifications and additions can be made to the preferred 
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embodiments discussed hereinabove without departing from 
the scope or spirit of the present invention. Accordingly, 
the scope of the present invention should not be limited by 
the particular embodiments discussed above, but should be 
5 defined only by the claims set forth below and equivalents 
thereof . 
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